• Fault-tolerance for MPI Codes on Computational Clusters 

      Hagen, Knut Imar (Master thesis, 2007)
      This thesis focuses on fault-tolerance for MPI codes on computational clusters. When an application runs on a very large cluster with thousands of processors, there is likely that a process crashes due to a hardware or ...